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AMENDMENTS TO THE CLAIMS: 

This listing of clainns will replace all prior versions, and listings, of claims in the 
application: 

LISTING OF CLAIMS: 

1 . (Currently Anriended) A method of displaying files within a file system 
to a user in a semantic hierarchy, the method comprising the steps of: 

mapping the files in the file system into a semantic vector space; 

clustering the files within said space, wherein multiple threshold values that 
are settable to desired levels of granularity are defined, and said files are clustered 
based on said multiple threshold values; 

deriving a hierarchy of plural levels of clusters from said clustering; and 

providing a user an option o ^to selectively switch between displaying the files 
in a hierarchical format of plural levels of clusters based on said derived hierarchy, of 
and displaying the files in a hierarchical format based on locations of the files in the 
file system. 

2. (Original) The method according to claim 1 , wherein the step of 
clustering the files is performed as a background routine during the operation of a 
computer associated with said file system. 
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3. (Original) The nnethoci according to clainn 2, wherein the step of 
clustering the files is performed in response to the creation of a new file within the file 
system. 

4. (Original) The method according to claim 1 , wherein said files are text 
documents and said mapping is conducted on the basis of a language model. 

5. (Original) The method according to claim 4, wherein said mapping step 
comprises the steps of constructing a matrix which associates each word in the 
documents with a vector and associates each document with a vector. 

6. (Original) The method of claim 5, further including the step of 
decomposing said matrix to define the words and documents as vectors in a 
continuous vector space. 

7. (Original) The method of claim 5, wherein said clustering is performed 
by identifying documents whose vectors are within a threshold distance of one 
another. 

8. (Canceled) 

9. (Previously Presented) The method of claim 5 further including the step 
of automatically labeling the clusters based on the resulting clusters. 
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10. (Original) The nriethod of clainn 9 wherein said labeling connprises 
selecting representative words based on the closeness of their vectors to the 
document vectors in a cluster. 

1 1 . (Currently Amended) A non-transitory computer-readable medium 
containing a graphical user interface configured to display files belonging to a file 
system in a virtual file system with a semantic hierarchy of plural levels of clusters 
that is derived from semantic similarities of said files, clustering said files belonging 
to the file system based on multiple threshold values that are settable to desired 
levels of granularity, and determining a directory structure having plural levels of 
clusters based on the clustering determined from similarities between said files, 
wherein the graphical user interface provides a user an option of -to selectively switch 
between graphically displaying the determined directory structure having plural levels 
of clusters to b e d i sp l ay e d on a display device, oi^ and displaying the files in a 
hierarchical format based on locations of the files in the v i rtual file system. 

12. (Canceled) 

1 3. (Currently Amended) The non-transitory computer-readable medium 
graphica l us e r i nt e rface according to claim 1 1 , wherein in the graphical user interface 
clustering of the files is initiated by user selection. 
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1 4. (Currently Annended) The non-transitory connputer-readable nriedium 
graph i cal us e r int e rfac e according to claim 1 1 , wherein in the graphical user interface 
clustering of the files is initiated upon creation of a new file in the file system. 

1 5. (Currently Amended) The non-transitorv computer-readable medium 
graph i ca l us e r i nt e rface according to claim 1 1 , wherein in the graphical user 
interface, text files are clustered utilizing a language model and non-text files are 
clustered utilizing rule-based techniques. 

1 6. (Currently Amended) The non-transitorv computer-readable medium 
graphica l us e r i nt e rfac e according to claim 15, wherein in the graphical user 
interface, said language model comprises the LSA paradigm. 

17. (Currently Amended) Non-transitorv computer Comput e r readable 
media having stored therein computer executable code for analyzing files in a file 
system to determine similarities in data pertaining to their content, clustering said 
files in the file system based on multiple threshold values that are settable to desired 
levels of granularity, determining a directory structure having plural levels of clusters 
based on the clustering determined from similarities between the files, and providing 
a user an option of -to selectively switch between displaying files in hierarchical 
format of plural levels of clusters based on the clustering determined from similarities 
between the files, Gi^arid_displaying the files in a hierarchical format based on 
locations of the files in the file system. 
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1 8. (Currently Annended) The non-transitory connputer-readable nnedia of 
claim 17 wherein said files are text docunnents, and the sinnilarities are based upon 
the word content of the files. 

1 9. (Currently Amended) The non-transitorv computer-readable media of 
claim 18 wherein said similarities are determined in accordance with a language 
model, and the files are clustered in accordance with said model. 

20. (Currently Amended) The non-transitorv computer-readable media of 
claim 19, wherein said language model comprises the LSA paradigm. 

21 . (Currently Amended) The non-transitorv computer-readable media of 
claim 19, wherein said computer-executable code performs the steps of constructing 
a matrix which associates each word in the documents with a vector and associates 
each document with a vector. 

22. (Currently Amended) The non-transitorv computer-readable media of 
claim 21 , wherein said computer-executable code further performs step of 
decomposing said matrix to define the words and documents as vectors in a 
continuous vector space. 

23. (Currently Amended) The non-transitorv computer-readable media of 
claim 22, wherein said computer-executable code performs clustering by identifying 
documents whose vectors are within a threshold distance of one another. 
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24. (Canceled) 

25. (Currently Anriended) The non-transitory computer-readable media of 
claim 19, wherein said computer-executable code performs step of automatically 
labeling the clusters based on the resulting clusters. 

26. (Currently Amended) The non-transitorv computer-readable media of 
claim 25, wherein said labeling comprises selecting representative words based on 
the closeness of their vectors to the document vectors in a cluster. 

27. (Currently Amended) The non-transitorv computer readable media 
according to claim 17, wherein the computer executable code performs the following 
steps: 

clustering text files within the file system using semantic similarities; 
clustering non-text files within the files system using rule-based techniques; 
labeling the resulting clusters; and 

displaying the files in a hierarchical format based on the resulting clusters and 

labels. 

28. (Currently Amended) A computer system, comprising: 
a file system storing files; 

a display device; 
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a processor for analyzing the content of files stored in said file systenn to map 
said files into a semantic vector space, cluster the files within said space based on 
multiple threshold values that are settable to desired levels of granularity, and derive 
a hierarchy of plural levels of clusters from said clustering; and 

a user interface which provides a user an option of -to selectively switch 
between displaying files stored in said file system in the form of said derived 
hierarchy of plural levels of clusters, of ^and displaying the files in a hierarchical 
format based on locations of the files in the file system. 

29. (Canceled) 

30. (Previously Presented) The computer system of claim 28, wherein said 
files are text documents and said processor maps said files on the basis of a 
language model. 

31 . (Original) The computer system of claim 30 wherein said processor 
constructs a matrix which associates each word in the documents with a vector and 
associates each document with a vector. 

32. (Original) The computer system of claim 31 wherein said processor 
further decomposes said matrix to define the words and documents as vectors in a 
continuous vector space. 
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33. (Original) The connputer system of clainn 31 , wherein said processor 
clusters the files by identifying documents whose vectors are within a threshold 
distance of one another. 

34. (Canceled). 

35. (Previously Presented) The computer system of claim 31 , wherein said 
processor automatically labels the clusters based on the resulting clusters. 

36. (Original) The computer system of claim 35 wherein said processor 
labels the clusters by selecting representative words based on the closeness of their 
vectors to the document vectors in a cluster. 

37. (Previously Presented) The method according to claim 1 , wherein said 
deriving step includes organizing the clusters into a hierarchical directory structure. 

38. (Previously Presented) A method of organizing a plurality of 
documents in a file system, comprising: 

mapping all words of the plurality of documents in the file system and the 
plurality of documents in a semantic vector space; 

generating a plurality of clusters based on the semantic similarities of the 
plurality of documents and multiple threshold values that are settable to desired 
levels of granularity; 
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organizing the plurality of clusters into directories in a hierarchical format of 
plural levels of clusters; and 

providing a user an option of displaying the plurality of documents in said 
hierarchical format of plural levels of clusters based on a result of clustering the 
plurality of documents, or displaying the documents in a hierarchical format based on 
locations of the documents in the file system. 

39. - 47. (Canceled) 

48. (Previously Presented) The method of claim 1 , wherein the multiple 
threshold values are characteristic values of clusters from said clustering. 

49. (Previously Presented) The method of claim 48, wherein the 
characteristic values of the clusters are cluster variances of the clusters. 

50. (Currently Amended) The graphica l us e r i nt e rfac e non-transitorv 
computer-readable medium according to claim 1 1 , wherein the multiple threshold 
values are characteristic values of clusters from said clustering. 

51. (Currently Amended) The graph i ca l us e r i nt e rfac e non-transitorv 
computer-readable medium according to claim 50, wherein the characteristic values 
of the clusters are cluster variances of the clusters. 
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52. (Currently Annended) The non-transitory computer-readable 
nnedia of claim 17, wherein the multiple threshold values are characteristic values of 
clusters from said clustering. 

53. (Currently Amended) The non-transitory computer-readable media of 
claim 52, wherein the characteristic values of the clusters are cluster variances of the 
clusters. 

54. (Previously Presented) The computer system of claim 28, wherein 
the multiple threshold values are characteristic values of clusters from said 
clustering. 

55. (Previously Presented) The computer system of claim 54, wherein the 
characteristic values of the clusters are cluster variances of the clusters. 

56. (Previously Presented) The method of claim 38, wherein the 
multiple threshold values are characteristic values of clusters from said clustering. 

57. (Currently Amended) The comput e r method of claim 56, wherein the 
characteristic values of the clusters are cluster variances of the clusters. 

58. (Previously Presented) The method of claim 1 , further comprising 
providing a user an option to reorganize the files in the file system according to the 
derived hierarchy. 



